On a problem of Erdos and Rothschild on edges in triangles 
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£Sj . Abstract 

Erdos and Rothschild asked to estimate the maximum number, denoted by /i(n, c), such that 
every n- vertex graph with at least cn 2 edges, each of which is contained in at least one triangle, 
must contain an edge that is in at least h(n, c) triangles. In particular, Erdos asked in 1987 
to determine whether for every c > there is e > such that h(n, c) > n e for all sufficiently 
large n. We prove that h(n,c) — rP^I lo g lo s™) for every fixed c < 1/4. This gives a negative 
answer to the question of Erdos, and is best possible in terms of the range for c, as it is known 
that every n- vertex graph with more than n 2 /A edges contains an edge that is in at least n/6 
, triangles. 

1 Introduction 

A book of size h in a graph is a collection of h triangles that share a common edge. The booksize of 
a graph G is the size of the largest book in G. The study of books in graphs was started by Erdos 
■ [5] in 1962, and has since attracted a great deal of attention in extremal graph theory (see, e.g., 

[21 EH H3]) and graph Ramsey theory (see, e.g., pH O H3 QH 13 HSJ [20] ) . 

Erdos and Rothschild [6] initiated the study of the booksize of graphs with the property that 
every edge is in a triangle. Let h(n, c) be the largest integer such that every n-vertex graph with 
at least cn 2 edges, each of which is contained in at least one triangle, must contain an edge that is 
in at least h(n, c) triangles. Erdos and Rothschild asked to estimate h(n, c) for fixed c > 0. This 
^ question has received considerable attention (see, e.g., the Erdos problem papers [HEIE], and the 

& I book |31). 

Using his regularity lemma, Szemeredi proved that for every c > 0, h(n,c) — > oo as n — > oo. 
This fact has a number of applications to various problems in extremal combinatorics. Ruzsa 
and Szemeredi |19j showed that the statement h(n, c) > 1 for every fixed c > and sufficiently 
large n implies Roth's theorem: that every subset of the first n positive integers without a 3-term 
arithmetic progression has size o(n). They also showed that it is equivalent to the (6, 3)-theorem: 
that every 3-uniform hypergraph on n vertices in which the union of any 3 edges contains more 
than 6 vertices has o(n 2 ) edges. In the other direction, Alon and Trotter (see [8]) proved that for 
each c < 1/4 there is d > such that h(n,c) < dy/ri. The condition c < 1/4 is best possible, 
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because independent results of Edwards [4J and Khadziivanov and Nikiforov [13] state that any 
n- vertex graph with more than n 2 /4 edges contains an edge in at least n/6 triangles. In particular, 
this implies for c > 1/4, we must have h(n, c) > n/6. 

For over two decades, there was no improvement on the 0(y/n) upper bound for any fixed 
c < 1/4. Indeed, Erdos even proposed that perhaps the lower bound should be improved to a 
power of n. Specifically, in 1987 he asked in [6] whether there is a constant e > such that 
h(n, c) > n € for every fixed c > and all sufficiently large n. This question was also featured in the 
book Erdos on Graphs [3]. We give a negative answer to this question. In fact, Theorem 1 1 . 1 1 b elow 
implies that h(n,c) = for every fixed c < 1/4. By the above remark that h(n,c) > n/6 for 
c > 1/4, this gives a best possible range for c with this bound and shows that a sharp transition 
occurs when c is near 1/4. 

Theorem 1.1. For all sufficiently large n, there are n-vertex graphs with ^ (l — e - ( logn ) 1/6 ) edges, 
with the property that every edge is in a triangle, but no edge is in more than n 14 / loglogri trianglesa 

The study of h(n,c) with c near 1/4 began in the problem papers of Erdos [8]. Let / be 
such that cn 2 = n 2 /4 — f(n)n. Erdos [7] proved if / is constant, then /i(n, c) = Q(n). Bollobas and 
Nikiforov [2j further showed that h(n,c) is asymptotically n/6 if / — > 0. If / tends to infinity with 
n, but not too quickly, so that f(n) < n 2 / 5 , they showed that /i(n, c) is asymptotically ^— ^==. 

Note that Theorem 11.11 shows that this behavior cannot continue when /(n) approaches linearity 
in n. In fact, similar constructions, which we omit, show that there are positive absolute constants 
a, e such that h(n, c) = (9(n 1//2_<E ) where /(n) = n 1_a . This shows that the asymptotic behavior of 
/i(n, c) discovered by Bollobas and Nikiforov with c very near 1/4 already breaks down when /(n) 
is some power of n which is less than 1. 

We close the introduction by discussing lower bounds on h(n, c) for fixed c > 0. The fact 
that h(n, c) tends to infinity follows from the triangle removal lemma, which is a consequence of 
Szemeredi's regularity lemma. The triangle removal lemma states that for each fixed e > there is 
5 > such that every graph on n vertices with at most <5n 3 triangles can be made triangle-free by 
removing at most en 2 edges. Suppose G is an n-vertex graph with cn 2 edges, each of which is in at 
least one and at most h = h(n, c) triangles. The total number of triangles in G is at most hcn 2 /3. 
Thus, if S > then there are en 2 edges of G such that every triangle of G contains at least one 
of these edges. Since every edge of G is in at least one triangle, there are at least cn 2 /3 triangles 
in G, and hence there is an edge in at least (cn 2 /3)/(en 2 ) = ^ triangles. As no edge is in more 
than h triangles, this implies ^ < h. The regularity proof gives a bound for <5 -1 in the triangle 
removal lemma which is a tower of twos of height a power of e . Together with the above bounds 
on /i(n, c), this implies h(n, c) is at least a power of the iterated logarithm log* n. Recently, the first 
author [12] gave a new proof of the triangle removal lemma which avoids Szemeredi's regularity 
lemma and gives a better bound. Namely, in the triangle removal lemma, we can take 5~ l to be a 
tower of twos of height logarithmic in e . This gives a lower bound for h(n, c) which is exponential 
in log* n. 

1 A\1 logarithms in this paper are in base e « 2.718. 
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2 Tools 



The properties of our construction are essentially derived from the concentration of measure. Say 
that a random variable X(u) on an n-dimensional product space f2 = Y\7=i * s C-Lipschitz if 
changing u in any single coordinate affects the value of X{uj) by at most C. The Hoeffding-Azuma 
inequality (see, e.g., pQ) provides concentration for these distributions. 

Theorem 2.1 (Hoeffding-Azuma Inequality). Let X be a C-Lipschitz random variable on an n- 
dimensional product space. Then for any t > 0, 

P[|A-E[A]|>t]<2exp|-^- 

We also need the following well-known formula for the volume of a high-dimensional Euclidean 
ball. The formula is slightly different for even and odd dimensions. Since our analysis is asymptotic 
in nature, it suffices to consider only even dimensions (which yield simpler forms). 

Theorem 2.2. For a positive even integer d and a positive real number r, the volume of Br , the 



d-dimensional Euclidean ball with radius r, is 



- 7r d/2„d 



(d/2)\ ■ 

The following weaker estimate turns out to be more convenient for our analysis. 
Corollary 2.3. For a positive even integer d and a positive real number r, 



VoI(bW) < (2vre) d / 2 - 



d d / 2 ' 

The desired bound in the corollary follows from the standard estimate d\ > (|) d , which is 
routinely obtained by bounding log(d!) = Ym=i > jf^ogxdx. 



3 Construction 

We first describe a graph which almost has the desired properties. Specifically, no edge will be 
in many triangles, and the number of edges will be quadratic in the number of vertices, but some 
edges may fail to be in triangles. Throughout this section, we will write x = y±5orxisiny±<5 
to denote y — 5 < x < y + 5. 

Pre-Construction. For a positive even integer r, let d = r 5 , let n = r d , and let \x = r2 f T 1 • d. 
Consider the tripartite graph with vertex set Au B U C , where each of A and B are copies of [r] d , 
and C = {0, 1, . . . , r + l} d . Vertices a G A and b £ B are joined by an edge if and only if (when 
considered as lattice points in [r] d ) their distance satisfies ||o — 6||| = /x±d. Similarly, vertices b € B 
and c G C are adjacent if and only if \\b — c||| = 4 ± 2d. Finally, c G C and a G A are adjacent if 
and only if ||c — a||| = 4 ± 2d. 

The following lemma will help us to show that the bipartite graph between A and B is nearly 
complete. 
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Lemma 3.1. Let r and d be given integers, and let U and V be two lattice points sampled inde- 
pendently and uniformly at random from [r] d . Define 

r 2 - 1 , 

a = • d . 

P 6 

_ d 

Then with probability at least 1 — 2e 2^ , \\U — V\\2 = ± d. 

Proof. Let U = (U\, . . . , U r ) and V = (Vi, . ■ ■ , V r ). The squared L2 distance is precisely ^2i(U — 
Vi) 2 , which is a sum of r independent random variables. A simple calculation shows that 

E [(Ui - Vx) 2 ] = E [Vf] - 2E [Ui] E [Vx] + E [V 2 ] = 2 (e [uf] - E [Ui] 2 

Since U± is an integer picked uniformly at random from [r], then E [Ui] = — ijp while 



E [U[ 



2 

1 r(r + l)(2r + l) (r + l)(2r + l) 



so 

^.2 



E [(U x - Vxf 



6 ' 
and hence 

E[\\u-v\\ 2 ] = r —^-d. 

On the other hand, each (Ui — Vi) 2 is less than r 2 , so by the Hoeffding-Azuma inequality (Theorem 
I2.ip . the probability that \\U — V\\ 2 deviates from its expectation by more than d is at most 

d 2 _ d 

2e zr^d = 2e 2^ , 

as claimed. □ 

Next, we show that every edge between A and B is in a positive number of triangles, but not 
too many. 

Lemma 3.2. In the Pre- Construction, the number of edges that join A and B is at least (1 — 
2e~2^)n 2 , and every one of those edges is contained in between 2 d ~ 1 and 15 rf triangles. 

Proof. The first claim is an immediate consequence of the previous lemma. We then move to 
establish a lower bound on the number of triangles that contain a given edge ab. By definition, we 
have \\a — 6|| | = \i ± d. Let m = (mi, . . . , m^) denote the midpoint of a = (ai, . . . , aj) and b = 
(61, ... , bd) when considered as points in [r] d . Note that although a and b have integer coordinates, 
m may have half-integer coordinates. Let x% = hi — ai\ then mi — 04 = For each i, if x% is odd, 
define 6{ = ^, and if X{ is even, define <5j = 1. Consider lattice points c of the form cj = ra, + 5iei, 
where e« G {±1}- All such points still lie in C because C = {0, . . . , r + l} d . Then, 

lie - a\\ 2 2 = (y + (JiCt) = — 4 °^ 2 + E ^ + E x ^ e < 



|2 



2 116- alii 



c|| 2 = S (y ~ ^ e v = — + £ ^ ~ £ Xi<5i£i 
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Since ||6 — <3,||1 = A* ± <i and J2i $i — d, every choice of (ej) satisfying 



will produce a point c G C which is permissible as the third vertex of a triangle containing ah. 
(It would make \\c — a||| and ||6 — c||| both in § ± 2d.) Now consider the e« as independent 
uniform random variables over {±1}, and define the random variable Z = £E x i$i e i- By symmetry, 
E [Z] = 0, and since \xi\ < r, changing the choice of a particular ej cannot affect Z by more than 
2r. Therefore, the Hoeffding-Azuma inequality (Theorem 12, 1|) gives 



\Z\ > - A d 
4 



< 2 exp 



w 



2(2r) 2 d 



<2e 



which implies that the number of valid c is at least 



1 _ 2e~TS^ | • 2 d > 2 d ~ x 



as claimed. 

For the upper bound, again assume that we are given a, b such that \\a 

Xj = b 



fj, ± d, and let 



cij. We will bound the number of half-lattice points c of the form a = + ^ + ^ j where 



u;j £ Z, which satisfy 



£ ±2d and ||6 



^ ± 2d For this, observe that 



E 
E 



+ 



y 



2 
2 



a lli 



+ zE-* + 2 



^E 

2 ^ 

IE 



X^W{ 



so we always have 



+ 116 



It, 



W: 



2 2 

i 

Hence whenever both ||c — a\\\ and ||6 — c||| are in 4 ± 2(f, we also have X^ii'f < 9d. It therefore 
suffices to bound the number of lattice points in B^\^, the d-dimensional Euclidean ball of radius 

3y/d centered at the origin. Observe that this is at most the volume of £r d ^ because by placing 
a unit d-dimensional cube centered at each lattice point in B^r^, we obtain a non-overlapping 
collection of unit cubes all contained in the ball of radius 3y/d + \yd by the triangle inequality 
(the greatest distance from the center of a unit cube to a point on its boundary is \^fd). 

Yet Corollary 12.31 bounds the volume of the (i-dimensional Euclidean ball of radius 3.5\/d by 



(2vre) 



d/2 _ 



LbVdY 

d d / 2 



< 15° 



as claimed. 



□ 
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Lemma 3.3. In the Pre- Construction, every edge joining B and C, or joining A and C, is con- 
tained in at most 15 d triangles. 

Proof. Assume that we are given a, c such that ||c — a[|| = 4 =t 2c?, and let = q — a^. We will 
bound the number of lattice points b of the form b% = at + 2yi + Wi, where Wi S Z, which satisfy 
11^ ~~ c lli = f =t 2d and \\b — a||| = fi±d. For this, observe that 

II 6 - c \\l = ^2(Vi + w if = \\ c ~ a \\l + ^2 w i + 2 ^2 ViWi 
i i i 

\\ b ~ a \\l = ^2( 2 Ui + w if = 4 II C - alii + ^2 W 1 + 4:^yiWi , 



and hence 



|6-o|||-2||6-c||| = 2||c-a|||-^ 



Therefore, the only way to have both \\b — c||| = j ± 2d and ||6 — a||| = /U ± d is to also have 
X)j w, ? < 9d. By the same computation as in the proof of the previous lemma, the number of such 
integral (u>i) is less than 15 rf . Hence, every edge between A and C is in at most 15 d triangles. By 
symmetry, every edge between B and C also is in at most 15 d triangles. □ 

We are now ready to prove the main theorem. 

Proof of Theorem 11.11 Start with the Pre-Construction for a (sufficiently large) even integer 
r, with d = r 5 and n = r d = \A\ = \B\. Note that n = d d ^ 5 , so d = (1 + 0(1)) ^°^ . We will 
take a random subgraph by sparsifying C. Let C C C with \C'\ = 2~ d l 2 \C\ be picked uniformly 
at random. 

Next, consider an edge ah joining A and B. By Lemma 13.21 in the Pre-Construction the edge 
ab was in at least 2 d ~ 1 triangles with vertices in C. Let E a b be the event that the edge ab is not 
in a triangle with a vertex from C' . This happens precisely when none of the > 2 d ~ 1 vertices in C 
that form a triangle with ab are in C' . Hence, 

2 d-lN|C|/2^ 
< I <e- 2 *-\ 




ici ; 

and the expected number of edges ab for which E a \, occurs is at most 

d 1 d 1 

|A| |i>|e = n e 

d _ 1 

Fix a choice of C with at most n 2 e~ 2 ^ edges a& satisfying E 1 ^. Consider the subgraph induced 
by A U B U C". The total number of vertices in the graph is only 

N = 2n + \C'\ = (2 + 2- d / 2 ( r -±^-) d )n< (2 + 2~f • e™) n < (2 + 2~f ) n. (1) 



Unfortunately, now some edges are no longer in triangles. We resolve this by deleting all such 
edges. By Lemma 13.21 with r = d 1 / 5 , the number of edges between A and B was originally at least 
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n 2 (l — 2e~ 2 dl/j ) , so since we chose C such that at most n 2 e~ 21 edges ab are not in triangles, the 
number of remaining edges between A and B after deleting those not in triangles is still at least 
n 2 (l — 3e~z d /o ). Therefore, by ([1]), the number of remaining edges between A and B is at least 

• V ' 2 'l - 3e-^ 1/5 ) >^(l- > - ( 1 - e~**W 



.|V V / 4 \ J A 



so the remaining graph has the claimed total number of edges. Finally, note that our deletions 
cannot create any new triangles, so by Lemmas 13.21 and 13.31 every edge is still in at most 

15 d = 15 (1+o(1))tS^T < ^WloglogTV 

triangles, completing our proof. □ 

Remark 1. The use of randomness to pick C in the above construction is not necessary. Indeed, 
the construction can be made explicit by instead picking C greedily so that each new vertex added 
to C (locally) maximizes the number of edges between A and B that are in triangles with vertices 
from C . 

Remark 2. After publicizing this result, the authors received the following nice observation from 
Noga Alon. The objective of sparsifying C to C was to raise the edge density to approach 1/4. A 
simpler way to increase the density is to leave C alone, and instead replace each vertex of A U B 
with exactly 2 d copies of itself, joining two copies of (different) vertices by an edge if their original 
vertices were initially adjacent, and joining a copy of a vertex in Au B to an uncopied vertex c € C 
if the corresponding original vertex of A U B was adjacent to c. This avoids our final probabilistic 
arguments altogether, and allows for the further simplification that in the Pre-Construction, all of 
A,B,C can be taken to be [r] d . Then, it suffices to replace the lower bound in Lemma 13.21 with 
the observation that for any edge ab between A and B, the integer-rounded midpoint produces at 
least one point c which completes ab to a triangle. 
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